PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_009106347.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassiceae; Brassica
Family Trihelix
Protein Properties Length: 551aa    MW: 62218.7 Da    PI: 5.6757
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_009106347.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix89.53.7e-2858142187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW++qe+ aL+++r++m+ ++r++++k+plWeevs+km e g+ r++k+Ckek+en+ k++k++keg+ ++++++  t+++fdqlea
  XP_009106347.1  58 RWPRQETVALLKIRSDMGIAFRDASAKGPLWEEVSRKMGELGYIRNAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFDQLEA 142
                     8********************************************************************975544..6*******85 PP

2trihelix103.12.2e-32366451187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW+k e+ aLi++r+++++++ ++  k+plWee+s+ mr+ gf+r++k+Ckekwen+nk++kk+ke++kkr +++s+tcpyf+ql+a
  XP_009106347.1 366 RWPKVEIEALIKLRTNLDSKYLENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR-PQDSKTCPYFHQLDA 451
                     8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.05955117IPR001005SANT/Myb domain
CDDcd122031.94E-2157122No hitNo description
PROSITE profilePS500906.87657115IPR017877Myb-like domain
PfamPF138373.6E-1757143No hitNo description
PROSITE profilePS500907.201359423IPR017877Myb-like domain
SMARTSM007170.0022363425IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.605.6E-4364422IPR009057Homeodomain-like
CDDcd122031.07E-25365430No hitNo description
PfamPF138372.9E-21365452No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 551 aa     Download sequence    Send to blast
MEFGGGTTTT TSAPAEAPPP PQSNDAAAAT EAAAAAATVG AFEVSEEMND RGGFGGNRWP  60
RQETVALLKI RSDMGIAFRD ASAKGPLWEE VSRKMGELGY IRNAKKCKEK FENVYKYHKR  120
TKEGRTGKSE GKTYRFFDQL EALETHHQPQ TQPPPLRPHN NNSSMFSTPP PVTTTIIPPT  180
TTPSFPNISG DFMSDNSTSS SSSYSTSSDV DIGGGGRNKK KRKRKWKEFF ERLMKQVVDK  240
QEELQRQFLE AVEKRERERM AREESWRAQE IARINREREI LAQERSMSAA KDAAVMAFLQ  300
KFSEKPNPQG QPQPQPQPQV NNNNNQQTSQ TPQPPPPPLP QPTLDTAKTD NGDQIMTTPA  360
SASSSRWPKV EIEALIKLRT NLDSKYLENG PKGPLWEEIS AGMRRLGFNR NSKRCKEKWE  420
NINKYFKKVK ESNKKRPQDS KTCPYFHQLD ALYRERNKFQ TTTTNNNVAS SSSTKPDNSV  480
PLMVQPEQQW PPAATVSQAD HHPAQPLDQN YDDEEGTDEE DYDDEEEDEE NEEEEGEFEL  540
VPSNDNKTNN V
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1218224KKKRKRK
2219224KKRKRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0792831e-108AC079283.4 Arabidopsis thaliana chromosome 1 BAC F7O12 genomic sequence, complete sequence.
GenBankCP0026841e-108CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009106347.10.0PREDICTED: trihelix transcription factor GT-2-like
SwissprotQ391171e-143TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLM4DGU00.0M4DGU0_BRARP; Uncharacterized protein
STRINGBra015716.1-P0.0(Brassica rapa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-176Trihelix family protein